Fusion of Children's Speech and 2D Gestures when Interacting with 3D Embodied Conversational Characters
نویسندگان
چکیده
Most of the existing multimodal prototypes enabling users to combine 2D gestures and speech are task-oriented. They help adult users to solve particular information tasks often in 2D standard Graphical User Interfaces. This paper describes the NICE HCA system which aims at demonstrating multimodal conversation between humans and embodied historical and literary characters. The target users are 10-18 years old children and teenagers. We discuss issues in 2D gestures recognition and interpretation, temporal and semantic dimensions of input fusion, ranging from systems and component design through technical evaluation and user evaluation with two different groups. We observed that the recognition and the understanding of spoken deictics revealed to be quite robust and spoken deictics were always used in multimodal input. We identified the causes of the most frequent failures of input fusion, i.e., end of speech management in the speech recogniser, gestures on nonreferenceable objects, and input gesturing while the character is preparing to speak. We suggest possible improvements for removing these errors and conclude on the knowledge provided by the NICE HCA system on how children gesture and combine their 2D gestures with speech when conversing with a 3D character in such a multimodal conversation oriented system.
منابع مشابه
Fusion of children's speech and 2D gestures when conversing with 3D characters
Most existing multi-modal prototypes enabling users to combine 2D gestures and speech input are task-oriented. They help adult users solve particular information tasks often in 2D standard Graphical User Interfaces. This paper describes the NICE Andersen system, which aims at demonstrating multi-modal conversation between humans and embodied historical and literary characters. The target users ...
متن کاملTiming and Rhythm in Multimodal Communication for Conversational Agents
Synthesis of lifelike gesture is finding growing attention in human-computer interaction. In particular, synchronization of synthetic gestures with speech output is one of the goals for embodied conversational agents which have become a new paradigm for the study of gesture and for human-computer interface (Cassell et al., 2000). Embodied conversational agents are computer-generated characters ...
متن کاملMultimodal Input Fusion in Human-computer Interaction
In this paper, we address the modality integration issue on the example of a system that aims at enabling users to combine their speech and 2D gestures when interacting with life-like characters in an educative game context. In a preliminary limited fashion, we investigate and present the use of combined input speech, 2D gesture and environment entities for user system interaction.
متن کاملChildren’s Gesture and Speech in Conversation with 3D Characters
This paper deals with the multimodal interaction between young users (children and teenagers) and a 3D Embodied Conversational Agent representing the author HC Andersen. We present the results of user tests we conducted on the first prototype of this conversational system and discuss their implications for the design of the second prototype and for similar systems.
متن کاملMental Timeline in Persian Speakers’ Co-speech Gestures based on Lakoff and Johnson’s Conceptual Metaphor Theory
One of the introduced conceptual metaphors is the metaphor of "time as space". Time as an abstract concept is conceptualized by a concrete concept like space. This conceptualization of time is also reflected in co-speech gestures. In this research, we try to find out what dimension and direction the mental timeline has in co-speech gestures and under the influence of which one of the metaphoric...
متن کامل